Medical Information Extracting System by Bootstrapping of NTTDRDH at NTCIR-10 MedNLP Task

نویسندگان

  • Yuji Nomura
  • Takashi Suenaga
  • Daisuke Satoh
  • Megumi Ohki
  • Toru Takaki
چکیده

We participated in a complaint and diagnosis task of MedNLP in NTCIR10. We extracted words of complaint/diagnosis by using a hybrid approach with bootstrapping and pattern matching with a medical term dictionary. It was possible that part of the complaint’s or diagnosis’s expressions are present in the extracted words. Therefore, our system concatenated the extracted words and their surrounding words by heuristic rules and determined the final complaint’s or diagnosis’s words. And our system estimated the modality attribute of the extracted complaint/diagnosis by heuristic rules also.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Overview of the NTCIR-10 MedNLP Task

Recently, medical records are increasingly written on electronic media instead of on paper, thereby increasing the importance of information processing in medical fields. We have organized an NTCIR-10 pilot task for medical records. Our pilot task, MedNLP, comprises three tasks: (1) de-identification, (2) complaint and diagnosis, and (3) free. These tasks represent elemental technologies used t...

متن کامل

UT-FX at NTCIR-10 MedNLP: Incorporating Medical Knowledge to Enhance Medical Information Extraction

The UT-FX team participated in the de-identification subtask and the complaint and diagnosis subtask of the NTCIR10 MedNLP pilot task. This report describes our approach to solving the two subtasks.

متن کامل

Overview of the NTCIR-11 MedNLP-2 Task

Electronic medical records are now often replacing paper documents, and thus the importance of information processing in medical fields has increased. We have already organized the NTCIR-10 MedNLP pilot task. It has been the very first shared task attempt to evaluate technologies to retrieve important information from medical reports written in Japanese, whereas the NTCIR-11 MedNLP-2 task has b...

متن کامل

Clinical Entity Recognition Using Cost-Sensitive Structured Perceptron for NTCIR-10 MedNLP

This paper reports on our approach to the NTCIR-10 MedNLP task, which aims at identifying personal and medical information in Japanese clinical texts. We applied a machine learning (ML) algorithm for sequential labeling, specifically, structured perceptron, and defined a cost function for lowering misclassification cost. On the test set provided by the organizers, our approach achieved an F-sco...

متن کامل

NTCIR-10 MedNLP Task Baseline System

Natural language processing (NLP) technology that handles clinical, medical and health records has been drawn much attention, because such kinds of records potentially could be rich clinical resources. This paper describes an NLP system that extracts two kinds of information from clinical documents in Japanese, which was developed as a baseline system in the NTCIR-10 MedNLP Pilot Task. Since ou...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2013